Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Estimation de l’inclinaison d’un document arabe manuscrit numérisé par analyse temps-fréquence des histogrammes de projection

Identifieur interne : 003855 ( Main/Exploration ); précédent : 003854; suivant : 003856

Estimation de l’inclinaison d’un document arabe manuscrit numérisé par analyse temps-fréquence des histogrammes de projection

Auteurs : Nazih Ouwayed [France] ; Abdel Belaïd [France] ; François Auger [France]

Source :

RBID : ISTEX:EF5423881B2B9CCBB2DF9D356819B7204312C82F

Descripteurs français

English descriptors

Abstract

Nous présentons dans cet article une nouvelle méthode de détermination de l'inclinaison d'un document manuscrit arabe à l'aide d'une représentation temps-fréquence énergétique de la classe de Cohen. Cette méthode consiste à calculer d'abord les histogrammes de projection obtenus pour différents angles, puis à déterminer la valeur maximale de la représentation temps-fréquence de la racine carrée de ces histogrammes. L'orientation du document est alors estimée par l'angle de projection fournissant la valeur maximale la plus élevée. La méthode proposée a été testée sur 864 documents inclinés avec 9 représentations temps-fréquence différentes. Les résultats sont présentés et analysés à la fin de cet article.
Ancient Arabic textual archives contain a heavy volume of handwritten documents that need to be scanned and indexed. Some of these documents are skewed, making their recognition and indexing difficult because straight lines are more suitable for the word extraction by recognition systems. We are looking for a method that can robustly estimate this orientation, whatever the size of the document. The scientific literature already proposes some solutions for image document skew angle estimation. The projection techniques seem the most appropriate ones but need to be adapted to Arabic documents. In fact, in Arabic script, the words are made of PAWs (Parts of Arabic Words) which are almost vertical or oblique and which may distort the calculation of local orientation. This prevents to apply local techniques like nearest neighbors, because of the alignment irregularity, or global techniques such as the Hough Transform because of the difficulty of locating voting points. Although these techniques fit well to printed documents, they remain inadequate to handwritten documents, in which the interline distance is random and the skew angle can be large. Kavallieratou et al. employed Cohen's class distributions on Latin documents. This Cohen's class contains all the quadratic time-frequency distributions that are covariant under time- and frequency-shifts. The members of this class are identified by a particular kernel ϕdD (T,ξ) which determines their theoretical properties and their practical readability. In Kavallieratou's paper, the relationship between the distributions properties and the experimental results are not highlighted. We propose in this article to look for the most relevant properties related to the skew angle estimation problem and to find, thanks to them, the best distribution to use. To estimate the orientation angle, we propose to compute a time-frequency representation of the analytic signal xa(t) the centered squared root of the projection histogram x(t) of the document. The projection angle corresponding to the histogram with the highest maximum value of its time-frequency representation is considered as an estimation of the document orientation. To study the effectiveness of our approach, we have experimented it on 864 Arabic handwritten documents. These documents have different sizes, contain several types of writing, layout (with 1 or 2 columns), a mix of text and tables, etc. The experiments were prepared after a manual orientation of the documents into different angles ranging from −75° to +90°. We found that the Wigner-Ville distribution reaches the highest estimation rate (100%). The other distributions yield a lower estimation rate, either because they do not satisfy properties that are important for the skew angle estimation problem, such as the scale invariance property and the support conservation, because their localization of the signal components is not sufficiently precise to provide a skew angle estimation with the maximum of the representation, or because the parameters of these distributions are not fitted to the analysed histogram profiles. The skew angle estimator using the Wigner-Ville distribution is also compared to the projection analysis and Fourier Transform methods.

Url:
DOI: 10.3166/ts.26.307-319


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="fr">Estimation de l’inclinaison d’un document arabe manuscrit numérisé par analyse temps-fréquence des histogrammes de projection</title>
<author>
<name sortKey="Ouwayed, Nazih" sort="Ouwayed, Nazih" uniqKey="Ouwayed N" first="Nazih" last="Ouwayed">Nazih Ouwayed</name>
</author>
<author>
<name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaid">Abdel Belaïd</name>
<affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="laboratoire" n="5">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="institution">Centre national de la recherche scientifique</orgName>
<orgName type="institution">Institut national de recherche en informatique et en automatique</orgName>
</affiliation>
</author>
<author>
<name sortKey="Auger, Francois" sort="Auger, Francois" uniqKey="Auger F" first="François" last="Auger">François Auger</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:EF5423881B2B9CCBB2DF9D356819B7204312C82F</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.3166/ts.26.307-319</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HT0-0SFVXB8L-4/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">003929</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">003929</idno>
<idno type="wicri:Area/Istex/Curation">003886</idno>
<idno type="wicri:Area/Istex/Checkpoint">000961</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000961</idno>
<idno type="wicri:doubleKey">0765-0019:2009:Ouwayed N:estimation:de:l</idno>
<idno type="wicri:Area/Main/Merge">003933</idno>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:inria-00435471</idno>
<idno type="url">https://hal.inria.fr/inria-00435471</idno>
<idno type="wicri:Area/Hal/Corpus">005B46</idno>
<idno type="wicri:Area/Hal/Curation">005B46</idno>
<idno type="wicri:Area/Hal/Checkpoint">002934</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">002934</idno>
<idno type="wicri:doubleKey">0765-0019:2009:Ouwayed N:estimation:de:l</idno>
<idno type="wicri:Area/Main/Merge">003390</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Pascal:10-0066774</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000236</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000782</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000210</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000210</idno>
<idno type="wicri:doubleKey">0765-0019:2009:Ouwayed N:estimation:de:l</idno>
<idno type="wicri:Area/Main/Merge">003C44</idno>
<idno type="wicri:Area/Main/Curation">003855</idno>
<idno type="wicri:Area/Main/Exploration">003855</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="fr">Estimation de l’inclinaison d’un document arabe manuscrit numérisé par analyse temps-fréquence des histogrammes de projection</title>
<author>
<name sortKey="Ouwayed, Nazih" sort="Ouwayed, Nazih" uniqKey="Ouwayed N" first="Nazih" last="Ouwayed">Nazih Ouwayed</name>
<affiliation wicri:level="4">
<country xml:lang="fr">France</country>
<wicri:regionArea>Université Nancy 2, LORIA, équipe READ, Vandœuvre-Lès-Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
<settlement type="city" wicri:auto="agglo">Nancy</settlement>
</placeName>
<orgName type="university">Université Nancy 2</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaid">Abdel Belaïd</name>
<affiliation wicri:level="4">
<country xml:lang="fr">France</country>
<wicri:regionArea>Université Nancy 2, LORIA, équipe READ, Vandœuvre-Lès-Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
<settlement type="city" wicri:auto="agglo">Nancy</settlement>
</placeName>
<orgName type="university">Université Nancy 2</orgName>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="laboratoire" n="5">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="institution">Centre national de la recherche scientifique</orgName>
<orgName type="institution">Institut national de recherche en informatique et en automatique</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="laboratoire" n="5">Laboratoire lorrain de recherche en informatique et ses applications</orgName>
<orgName type="university">Université de Lorraine</orgName>
<orgName type="institution">Centre national de la recherche scientifique</orgName>
<orgName type="institution">Institut national de recherche en informatique et en automatique</orgName>
</affiliation>
</author>
<author>
<name sortKey="Auger, Francois" sort="Auger, Francois" uniqKey="Auger F" first="François" last="Auger">François Auger</name>
<affiliation wicri:level="4">
<country xml:lang="fr">France</country>
<wicri:regionArea>Université de Nantes, IREENA site de, Saint-Nazaire</wicri:regionArea>
<placeName>
<region type="region">Pays de la Loire</region>
<region type="old region">Pays de la Loire</region>
<settlement type="city">Saint-Nazaire</settlement>
</placeName>
<orgName type="university">Université de Nantes</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j" type="main">Traitement du Signal</title>
<title level="j" type="abbrev">Trait. Signal</title>
<idno type="ISSN">0765-0019</idno>
<idno type="eISSN">1958-5608</idno>
<imprint>
<publisher>Lavoisier</publisher>
<date type="published" when="2009-07">2009</date>
<biblScope unit="vol">26</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="307">307</biblScope>
<biblScope unit="page" to="319">319</biblScope>
<biblScope unit="page-count">13</biblScope>
<biblScope unit="ref-count">0</biblScope>
<biblScope unit="fig-count">0</biblScope>
<biblScope unit="table-count">0</biblScope>
</imprint>
<idno type="ISSN">0765-0019</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0765-0019</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Arabic</term>
<term>Asymmetry</term>
<term>Digitizing</term>
<term>Energy distribution</term>
<term>Histogram</term>
<term>Manuscript document</term>
<term>Parameter estimation</term>
<term>Projection method</term>
<term>Text analysis</term>
<term>Tilt angle</term>
<term>Time-frequency analysis</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Analyse fréquence temps</term>
<term>Analyse texte</term>
<term>Angle inclinaison</term>
<term>Arabe</term>
<term>Asymétrie</term>
<term>Distribution énergie</term>
<term>Document manuscrit</term>
<term>Estimation paramètre</term>
<term>Histogramme</term>
<term>Méthode projection</term>
<term>Numérisation</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Numérisation</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="fr">Nous présentons dans cet article une nouvelle méthode de détermination de l'inclinaison d'un document manuscrit arabe à l'aide d'une représentation temps-fréquence énergétique de la classe de Cohen. Cette méthode consiste à calculer d'abord les histogrammes de projection obtenus pour différents angles, puis à déterminer la valeur maximale de la représentation temps-fréquence de la racine carrée de ces histogrammes. L'orientation du document est alors estimée par l'angle de projection fournissant la valeur maximale la plus élevée. La méthode proposée a été testée sur 864 documents inclinés avec 9 représentations temps-fréquence différentes. Les résultats sont présentés et analysés à la fin de cet article.</div>
<div type="abstract" xml:lang="en">Ancient Arabic textual archives contain a heavy volume of handwritten documents that need to be scanned and indexed. Some of these documents are skewed, making their recognition and indexing difficult because straight lines are more suitable for the word extraction by recognition systems. We are looking for a method that can robustly estimate this orientation, whatever the size of the document. The scientific literature already proposes some solutions for image document skew angle estimation. The projection techniques seem the most appropriate ones but need to be adapted to Arabic documents. In fact, in Arabic script, the words are made of PAWs (Parts of Arabic Words) which are almost vertical or oblique and which may distort the calculation of local orientation. This prevents to apply local techniques like nearest neighbors, because of the alignment irregularity, or global techniques such as the Hough Transform because of the difficulty of locating voting points. Although these techniques fit well to printed documents, they remain inadequate to handwritten documents, in which the interline distance is random and the skew angle can be large. Kavallieratou et al. employed Cohen's class distributions on Latin documents. This Cohen's class contains all the quadratic time-frequency distributions that are covariant under time- and frequency-shifts. The members of this class are identified by a particular kernel ϕdD (T,ξ) which determines their theoretical properties and their practical readability. In Kavallieratou's paper, the relationship between the distributions properties and the experimental results are not highlighted. We propose in this article to look for the most relevant properties related to the skew angle estimation problem and to find, thanks to them, the best distribution to use. To estimate the orientation angle, we propose to compute a time-frequency representation of the analytic signal xa(t) the centered squared root of the projection histogram x(t) of the document. The projection angle corresponding to the histogram with the highest maximum value of its time-frequency representation is considered as an estimation of the document orientation. To study the effectiveness of our approach, we have experimented it on 864 Arabic handwritten documents. These documents have different sizes, contain several types of writing, layout (with 1 or 2 columns), a mix of text and tables, etc. The experiments were prepared after a manual orientation of the documents into different angles ranging from −75° to +90°. We found that the Wigner-Ville distribution reaches the highest estimation rate (100%). The other distributions yield a lower estimation rate, either because they do not satisfy properties that are important for the skew angle estimation problem, such as the scale invariance property and the support conservation, because their localization of the signal components is not sufficiently precise to provide a skew angle estimation with the maximum of the representation, or because the parameters of these distributions are not fitted to the analysed histogram profiles. The skew angle estimator using the Wigner-Ville distribution is also compared to the projection analysis and Fourier Transform methods.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
<li>Pays de la Loire</li>
</region>
<settlement>
<li>Nancy</li>
<li>Saint-Nazaire</li>
<li>Vandœuvre-lès-Nancy</li>
</settlement>
<orgName>
<li>Centre national de la recherche scientifique</li>
<li>Institut national de recherche en informatique et en automatique</li>
<li>Laboratoire lorrain de recherche en informatique et ses applications</li>
<li>Université Nancy 2</li>
<li>Université de Lorraine</li>
<li>Université de Nantes</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Grand Est">
<name sortKey="Ouwayed, Nazih" sort="Ouwayed, Nazih" uniqKey="Ouwayed N" first="Nazih" last="Ouwayed">Nazih Ouwayed</name>
</region>
<name sortKey="Auger, Francois" sort="Auger, Francois" uniqKey="Auger F" first="François" last="Auger">François Auger</name>
<name sortKey="Auger, Francois" sort="Auger, Francois" uniqKey="Auger F" first="François" last="Auger">François Auger</name>
<name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaid">Abdel Belaïd</name>
<name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaid">Abdel Belaïd</name>
<name sortKey="Ouwayed, Nazih" sort="Ouwayed, Nazih" uniqKey="Ouwayed N" first="Nazih" last="Ouwayed">Nazih Ouwayed</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003855 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 003855 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:EF5423881B2B9CCBB2DF9D356819B7204312C82F
   |texte=   Estimation de l’inclinaison d’un document arabe manuscrit numérisé par analyse temps-fréquence des histogrammes de projection
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022